A topic classification system based on parametric trajectory mixture models

نویسندگان

  • William Belfield
  • Herbert Gish
چکیده

In this paper we address the problem of topic classification of speech data. Our concern in this paper is the situation in which there is no speech or phoneme recognizer available for the domain of the speech data. In this situation the only inputs for training the system are audio speech files labeled according to the topics of interest. The process that we follow in developing the topic classifier is that of data segmentation followed by the representation of the segments by polynomial trajectory models. The clustering of acoustically similar segments enables us to train a trajectory Gaussian mixture model that is used to label segments of both on topic and off topic data and the labeled data enables us to create topic classifiers. The advantage of the approach that we are pursuing is that it is language and domain independent. We evaluated the performance of our approach with several classifiers demonstrated positive results.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Parametric trajectory models for speech recognition

The basic motivation for employing trajectory models for speech recognition is that sequences of speech features are statistically dependent and that the e ective and e cient modeling of the speech process will incorporate this dependency. In our previous work [1] we presented an approach to modeling the speech process with trajectories. In this paper we continue our development of parametric t...

متن کامل

Lane Change Trajectory Model Considering the Driver Effects Based on MANFIS

The lane change maneuver is among the most popular driving behaviors. It is also the basic element of important maneuvers like overtaking maneuver. Therefore, it is chosen as the focus of this study and novel multi-input multi-output adaptive neuro-fuzzy inference system models (MANFIS) are proposed for this behavior. These models are able to simulate and predict the future behavior of a Dri...

متن کامل

A Joint Semantic Vector Representation Model for Text Clustering and Classification

Text clustering and classification are two main tasks of text mining. Feature selection plays the key role in the quality of the clustering and classification results. Although word-based features such as term frequency-inverse document frequency (TF-IDF) vectors have been widely used in different applications, their shortcoming in capturing semantic concepts of text motivated researches to use...

متن کامل

Parametric Lorenz Curves and its Relationship with Reliability Indicators

The problem of poverty can not be considered without paying care to the topic of income distribution. In fact, the concept of poverty refers to the lowest class in the distribution of income, and inequality in the income distribution is related to all stages of society. Based on the income distributions that are unfairly priced, many indices have been developed to measure the size of inequality...

متن کامل

Analyzing the performance of different machine learning methods in determining the transportation mode using trajectory data

With the widespread advent of the smart phones equipping with Global Positioning System (GPS), a huge volume of users’ trajectory data was generated. To facilitate urban management and present appropriate services to users, studying these data was raised as a widespread research filed and has been developing since then. In this research, the transportation mode of users’ trajectories was identi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003